Learning What's Easy: Fully Differentiable Neural Easy-First Taggers
نویسندگان
چکیده
We introduce a novel neural easy-first decoder that learns to solve sequence tagging tasks in a flexible order. In contrast to previous easy-first decoders, our models are end-to-end differentiable. The decoder iteratively updates a “sketch” of the predictions over the sequence. At its core is an attention mechanism that controls which parts of the input are strategically the best to process next. We present a new constrained softmax transformation that ensures the same cumulative attention to every word, and show how to efficiently evaluate and backpropagate over it. Our models compare favourably to BILSTM taggers on three sequence tagging tasks.
منابع مشابه
Two Novel Learning Algorithms for CMAC Neural Network Based on Changeable Learning Rate
Cerebellar Model Articulation Controller Neural Network is a computational model of cerebellum which acts as a lookup table. The advantages of CMAC are fast learning convergence, and capability of mapping nonlinear functions due to its local generalization of weight updating, single structure and easy processing. In the training phase, the disadvantage of some CMAC models is unstable phenomenon...
متن کاملAuto-Differentiating Linear Algebra
Development systems for deep learning, such as Theano, Torch, TensorFlow, or MXNet, are easyto-use tools for creating complex neural network models. Since gradient computations are automatically baked in, and execution is mapped to high performance hardware, these models can be trained endto-end on large amounts of data. However, it is currently not easy to implement many basic machine learning...
متن کاملFlexTag: A Highly Flexible PoS Tagging Framework
We present FlexTag, a highly flexible PoS tagging framework. In contrast to monolithic implementations that can only be retrained but not adapted otherwise, FlexTag enables users to modify the feature space and the classification algorithm. We categorize existing PoS tagger implementations into one of three categories with regards to model-training capabilities and the level of access those imp...
متن کاملNumerical Coordinate Regression with Convolutional Neural Networks
We study deep learning approaches to inferring numerical coordinates for points of interest in an input image. Existing convolutional neural network-based solutions to this problem either take a heatmap matching approach or regress to coordinates with a fully connected output layer. Neither of these approaches is ideal, since the former is not entirely differentiable, and the latter lacks inher...
متن کاملAPPLICATION OF NEURAL NETWORK IN EVALUATION OF SEISMIC CAPACITY FOR STEEL STRUCTURES UNDER CRITICAL SUCCESSIVE EARTHQUAKES
Depending on the tectonic activities, most buildings subject to multiple earthquakes, while a single design earthquake is suggested in most seismic design codes. Perhaps, the lack of easy assessment to second shock information and sometimes use of inappropriate methods in estimating these features cause successive earthquakes mainly were ignored in the analysis procedure. In order to overcome t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017